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^ ■ ABSTRACT 

CU Early-type galaxies (ETGs) are observed to be more compact at z > 2 than in the 

local Universe. Remarkably, much of this size evolution appears to take place in a 
short ~ 1.8 Gyr time span between z ~ 2.2 and z ~ 1.3, which poses a serious 
challenge to hierarchical galaxy formation models where mergers occurring on a similar 
timescale are the main mechanism for galaxy growth. We compute the merger-driven 
rcdshift evolution of stellar mass M* oc (1 + z) aM , half-mass radius R c cx (1 + z) an 
and velocity-dispersion <jq oc (1 + z) a " predicted by concordance A cold dark matter 
for a typical massive ETG in the redshift range z ~ 1.3 — 2.2. Neglecting dissipative 
processes, and thus maximizing evolution in surface density, we find — 1.5<a/i/ < — 
0.6, — 1.9<afl< — 0.7 and 0.06 < a a < 0.22, under the assumption that the accreted 
satellites arc spheroids. It follows that the predicted z ~ 2.2 progenitors of z ~ 1.3 
ETGs are significantly less compact (on average a factor of ~ 2 larger R c at given 
M„) than the quiescent galaxies observed at z > 2. Furthermore, we find that the 
scatter introduced in the size-mass correlation by the predicted merger-driven growth 
is difficult to reconcile with the tightness of the observed scaling law. We conclude 
that - barring unknown systcmatics or selection biases in the current measurements 
- minor and major mergers with spheroids are not sufficient to explain the observed 
size growth of ETGs within the standard model. 

Key words: galaxies: elliptical and lenticular, cD — galaxies: formation — galaxies: 
kinematics and dynamics — galaxies: structure — galaxies: evolution 



1 INTRODUCTION 

Photometric and spectroscopic observations of high-redshift 
(z > 2) early-type galaxies (ETGs) suggest that these ob- 
jects may be remarkably more compact (e.g. Stiavelli et al. 
1999; Daddi et al. 2005; Trujillo et al. 2006; Zirm et al. 2007; 
Cimatti et al. 2008; van der Wei et al. 2008; van Dokkum 
et al. 2008; Saracco, Longhetti, & Andreon 2009; Cassata et 
al. 2011; Cimatti, Nipoti, & Cassata 2012; Damjanov et al. 
2011; Saracco, Longhetti, & Gargiulo 2011) and have higher 
velocity dispersion (Cenarro & Trujillo 2009; Cappellari et 
al. 2009; van Dokkum, Kriek, & Franx 2009; van de Sande 
et al. 2011) than their local counterparts. 

In the past few years, much theoretical work has been 
devoted to explaining the size evolution of massive ETGs 
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since z > 2. Dissipative effects, such as star formation and 
gas accretion, are expected to go in the opposite direction 
and increase galaxy stellar density (Robertson et al. 2006; 
Ciotti, Lanzoni, & Volonteri 2007; Covington et al. 2011). 
Therefore attention has focused on dissipationless ("dry") 
mergers, which appear to be the most promising mecha- 
nism to reproduce the observed evolutionary trends. Even 
though some groups have been able to reproduce the ob- 
served mean evolution by considering the combined effects of 
dry major and minor mergers, a potential contribution from 
active galactic nuclei (AGN; Fan et al. 2008, 2010; Ragone- 
Figueroa & Granato 2011), as well as a number of subtle ob- 
servational issues (Hopkins et al. 2010a; Mancini et al. 2010; 
Oser et al. 2012), it is clear that the tension is far from re- 
solved. Reproducing the average trend is only the first step. 
A successful model needs to also reproduce under the same 
assumption other properties of the mass-size/ velocity dis- 
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persion correlations, including environmental dependencies 
(Cooper et al. 2012; Shankar et al. 2011) and their tightness 
(Nipoti, Treu, & Bolton 2009a, hereafter N09a; Nipoti et al. 
2009b, hereafter N09b; Bernardi et al. 2011; Nair, van den 
Bergh, & Abraham 2011). 

The results of Newman et al. (2010, hereafter N10) fur- 
ther raise the stakes of the theoretical challenge. Bridging 
the gap between the local universe and z > 2, they found 
that ETGs at z ~ 1.3 are only moderately smaller in size 
than present-day ETGs at fixed velocity dispersion. To- 
gether with results at higher redshifts, this suggests that 
ETGs have evolved at a very rapid pace between z ~ 2.2 
and z ~ 1.3, followed by more gentle evolution until the 
present day (see also Cimatti, Nipoti, & Cassata 2012; Rai- 
choor et al. 2012; Newman et al. 2012, hereafter N12). These 
findings are confirmed and extended by the analysis of deep 
Cosmic Assembly Near-infrared Deep Extragalactic Legacy 
Survey (CANDELS) images which show that the observed 
visible satellites cannot account for the evolution in size and 
number density of massive ETGs by minor merging (N12, 
see also Bluck et al. 2012). 

Whereas most theoretical papers so far have focused on 
the entire evolutionary baseline z > 2 to the present, in this 
paper we focus on the shorter time span between z ~ 2.2 
and z ~ 1.3. This short timescale allows us to follow up 
a simple yet powerful and conservative approach. We start 
from two well-defined samples at z ~ 1.3, evolve them back 
in time to z ~ 2.2 and compare them to observational sam- 
ples at this higher redshift. In order to maximize the size 
evolution we neglect all dissipative processes, assuming that 
galaxies grow only by dry mergers. In other words, for given 
stellar-mass growth rate our models predict the maximum 
possible growth in size. Stellar mass could grow more than 
predicted by our models (as conversion of gas into stars is 
not accounted for), but, as mentioned above, this process is 
believed to have the effect of making galaxies more compact. 
In this sense our model is extreme: if it fails to reproduce 
the observed growth, then additional physical processes (e.g. 
feedback from AGN) or perhaps unknown selection effects 
must be considered in order to hope to reconcile the hier- 
archical model with the data. However, our dissipationless 
evolution model is also realistic in the sense that we adopt 
major and minor mergers rates and parameters taken from 
A cold dark matter (ACDM) cosmological simulations. We 
then used detailed iV-body simulations of individual merg- 
ers to compute the consequences of the mergers on galaxy 
structure and make robust predictions of their evolution in 
size, dark and luminous mass, and stellar velocity disper- 
sion. For simplicity we limit ourselves to mergers between 
spheroidal systems. Our approach combines the benefits of 
detailed numerical simulations of individual merger events 
with the required knowledge of merging parameters that 
can only be gathered from large-volume cosmological sim- 
ulations (for the dissipative case see Robertson et al. 2006; 
Hopkins et al. 2009). This paper supercedes our previous 
work (N09a; N09b) based on individual iV-body simulations 
in idealized merging conditions. 

Our reference data consist of two well-defined samples 
of ETGs: the first sample consists of galaxies with measured 
stellar velocity dispersion, size, and stellar mass. The sec- 
ond sample consists of galaxies with measured size and stel- 
lar mass, but not necessarily velocity dispersion. The first 



sample is in principle cleaner to interpret, since stellar ve- 
locity dispersion is changed relatively little by dry merg- 
ers (Hausman & Ostriker 1978; Hernquist, Spergel, & Heyl 
1993; Nipoti, Londrillo, & Ciotti 2003; Naab, Johansson, & 
Ostriker 2009) and therefore provides an excellent "label" to 
match samples at different redshifts. At the moment, there 
are only a handful of measurements of velocity dispersion at 
z > 1.8. Hence, the statistical power of this diagnostic is cur- 
rently limited. However, these calculations provide a useful 
benchmark and framework for interpreting the larger sam- 
ples that are expected to be collected in the near future us- 
ing multiplexed infrared spectrographs on large telescopes. 
The second sample is an order of magnitude larger in size, 
and currently provides the most stringent test of the galaxy- 
evolution models presented here. 

The manuscript is organized as follows. In Section 2 we 
summarize the properties of our comparison samples. In Sec- 
tion 3 we describe our models based on three ingredients: 
i) mergers and mass accretion rates inferred from cosmo- 
logical numerical simulations; ii) simple recipes to connect 
halo and stellar mass based on abundance matching tech- 
niques; iii) prescriptions for evolution of velocity dispersion 
and size based on individual merger iV-body simulations. As 
it turns out, the major source of theoretical uncertainty is 
related to the second step, i.e. matching stellar with halo 
mass. To quantify this uncertainty, we consider three inde- 
pendent recipes and we show that our conclusions are robust 
with respect to this choice. In Section 4 we compare our nu- 
merical predictions to the data. In Section 5 we perform a 
consistency check of our models by comparing the descen- 
dants of the 2 ~ 1.3 samples with the local scaling relations. 
The results are discussed in Section 6, and in Section 7 we 
draw our conclusions. 

Throughout the paper we assume Ho — 
73kms _1 Mpc -1 , J7a = 0.75 and fi m = 0.25, consis- 
tent with the values adopted in the Millenium I and 
II simulations (Springel et al. 2005; Boylan-Kolchin et 
al. 2009). We also adopt a Chabrier (2003) initial mass 
function (IMF). When necessary we transform published 
values of stellar mass to a Chabrier IMF, using appropriate 
renormalization factors. We note that our results are 
independent of the specific choice of the IMF, provided that 
the same IMF is used consistently to estimate stellar masses 
of observed galaxies and to connect observed properties 
with dark matter halos. 



2 OBSERVATIONAL DATA 
2.1 Early-type galaxies at z ~ 1.3 

Our first reference sample at 2 ~ 1.3 is comprised of 
spheroidal galaxies in the redshift interval 1 < z < 1.6 
observed by N10. Following N10 we consider only the 
sub-sample of galaxies with central stellar velocity dis- 
persion (Jo > 200kms _1 , which is estimated to be com- 
plete at the 90% level. This sample (hereafter VI; see Ta- 
ble 1) consists of 13 ETGs with stellar mass in the range 
10.5 < log A^/Mq < 11.3, with average redshift (z) ~ 1.3. 

Our second reference sample, without stellar velocity 
dispersion measures, consists of quiescent ETGs in the red- 
shift range 1 < 2 < 1.6 ((2) ~ 1.3) selected from the sample 
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Table 1. Properties of the samples of observed galaxies. 



Sample 


Range of z 


(*> 






References 


VI 


1 < z < 1.6 


1.3 


13 


yes 


1 


HI 


1 < z < 1.6 


1.3 


150 


no 


2 


V2 


1.8 < z < 2.2 


1.9 


4 


yes 


3, 4, 5, 6 


R2 


2 < z < 2.6 


2.2 


53 


no 


2, 4, 7, 8 



iVgaj: number of galaxies. (To = y e s(no): measures of are (are 
not) available. References: 1=N10, 2=N12, 3= Cappcllari et al. 
(2009), 4=van Dokkum, Krick, & Franx (2009), 5=Onodera et 
al. (2010), 6=van de Sande et al. (2011), 7=van Dokkum et al. 
(2008), 8=Kriek et al. (2008). 

of N12. This sample (hereafter Rl, see Table 1) comprises 
150 galaxies with measures of R e and stellar mass complete 
above M* > 10 104 M Q . 



2.2 Early-type galaxies at z ~ 2.2 

At z > 1.8, there are only a handful of ETGs with measured 
stellar velocity dispersion. Thus our first comparison sam- 
ple (hereafter V2, see Table 1) consists of 4 galaxies taken 
from the studies of Cappellari et al. (2009), van Dokkum, 
Kriek, & Franx (2009), Onodera et al. (2010, upper limit 
on co) and van de Sande et al. (2011). The average redshift 
of sample V2 is (z) — 1.9. Substantially larger is our sec- 
ond comparison sample, comprised of ETGs in the redshift 
range 2 < z < 2.6 with measured stellar mass and effec- 
tive radius. We construct this sample (hereafter R2, see Ta- 
ble 1) by selecting quiescent galaxies with M* > 1O 1O ' 4 M0 
from the studies by van Dokkum et al. (2008), Kriek et al. 
(2008), van Dokkum, Kriek, & Franx (2009) and N12. This 
results in a sample of 53 ETGs with properties very simi- 
lar to those of our sample of ETGs at z ~ 1.3, well-suited 
for a detailed comparison. Note that we use the term ETGs 
in a broad sense, including both morphologically selected 
spheroids and quiescent galaxies. The average redshift of 
sample R2 is (z) ~ 2.2, which we adopt as reference redshift 
when comparing models with observations. 



3 MODELS 

In this Section we describe how we compute the predicted 
properties of higher- z progenitors of our samples of galaxies 
at z ~ 1.3. For each galaxy, we need to compute evolution in 
stellar mass, effective radius, and stellar velocity dispersion, 
driven by the evolution of its dark matter (DM) halo mass 
Mh as predicted by cosmological iV-body simulations. 

The growth of stellar mass M* with z can be written in 
terms of dMh/dz as 



dM, dM* dM h 



(1) 



dz dM h dz 

In turn, the evolution of the central stellar velocity disper 
sion o"o is given by 

d<7 _ dcro dM, _ da dM* dM h 
~~dz~ ~ dTL~dT ~ dM* dM h dz ' 

while the evolution of the effective radius R c is given by 



(2) 



dR 

dz 



dR^d^L _ dR c dM* dM h 
dMl~dz~ ~ 



dM* dM h dz ' ^ 

Therefore, the key ingredients of our model are the four 
derivatives dM h /dz, dM t /dM h , da /dM* and di? e /dM*. 
Sections 3.1 to 3.3 describe in detail how these derivatives 
are calculated based on up-to-date cosmological iV-body 
simulations, abundance matching results, and detailed sim- 
ulations of individual merger events. Section 3.4 combines 
all the ingredients to compute the evolution of individual 
galaxies. 



3.1 Halo mass growth rate (dM-^/dz) 
3.1.1 Total mass growth rate 

Based on the Millenium I and II simulations, Fakhouri, Ma, 
& Boylan-Kolchin (2010) estimate the halo mass growth rate 
as follows. The average mass variation with redshift of a DM 
halo of mass Mh is 

b-l 

(4) 



d In M h 



Mo 



1 + az 



dz 



W 12 M G H 1 + z \ 10 12 M< 



M h 



with Mo = 46.1M0 / yr, a = 1.11 and b= 1.1. By integrating 
equation (4) between z& (the redshift of the descendant halo) 
and z we obtain 



" M h (z) ' 


l-b 


' M b (z d )~ 


1O 12 M 




1O 12 M 



M 



where 



1 +az' 
1 + z' 



dz' 



Ho W 12 M Q ' 



a(z — Zd) — (a — 1) In 



(5) 



1 + z 



l + z d 



.(6) 



This formalism can be used to quantify the growth rate of 
the halo of our descendant galaxies. The total accreted DM 
fraction <5Mh(z)/Mh(zd) is shown in Fig. 1 for a representa- 
tive descendant halo at z d = 1.3 with Mh(zd) = 5 x 10 12 Mq. 
Note that the estimate of Fakhouri, Ma, & Boylan-Kolchin 
(2010) is appropriate for main halos, not for sub-halos. How- 
ever, the large majority (~ 80%) of massive (M» ~ 10 1 Mq) 
red galaxies are central galaxies of halos (van den Bosch et 
al. 2008) even in the local universe. Therefore, we can sim- 
plify our treatment by assuming that our samples of massive 
ETGs consist of central halo galaxies (see also van der Wei 
et al. 2009). 



3.1.2 Mass growth rate due to mergers only 

The total growth rate shown in Figure 1 includes the con- 
tribution of mergers with other halos as well as accretion 
of diffuse DM (Fakhouri & Ma 2010; Genel et al. 2010). For 
our purposes, it is important to distinguish the two contribu- 
tions, because — as discussed below — we expect no substan- 
tial growth in stellar mass associated with diffuse accretion 
of DM 1 . 

The merger rate is expected to depend on the mass of 
the main halo Mh, on the redshift z, on the mass ratio £ 

1 Of course it is possible that the so-called "cold-flow" accre- 
tion of baryons is important in galaxy evolution (e.g. Keres et al. 
2005), but this is expected to be accretion of gaseous baryons, 
which wc can neglect in our pure dry- merger model. 
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Figure 1. The solid curve represents the total fraction of DM 
mass accreted between a and Zd = 1.3 of a descendant halo with 
mass Mh(Zft) = 5 X 10 12 Mq at z d = 1.3 as a result of mergers 
and diffuse accretion: SM^(z) = M^(z^) — Mj,(z). The dotted 
lines represent the fraction of DM mass accreted between z and 
Zd = 1.3, as a result of mergers only with mass-ratio £ > £ m i n . 
The curves are based on the analysis of the Millenium I and II 
simulations by Fakhouri, Ma, & Boylan-Kolchin (2010). 



between the satellite and the main halo, and on the merger 
orbital parameters (e.g., orbital energy E and orbital an- 
gular momentum L). Omitting for simplicity the explicit 
dependence on E and L, the halo evolution due to mergers 
can be written as 



d 2 M h 
dzd£ 



d 2 N n 



dzd£ 



'-(M h ,ti,z), 



(7) 



where £ < 1 is the mass ratio of the two DM halos involved 
in the merger, and d 2 Nm eTg /dzd£ is the distribution in z and 
£ of the number of mergers per halo. The mass accretion rate 
due to mergers with mass ratio higher than £ m i n is therefore 
given by 



dM h 



dz 



M h (z)£- 



dzd£ 



= d£. 



(8) 



Based on the Millenium I and II simulations, Fakhouri, Ma, 
& Boylan-Kolchin (2010) estimate 



d 2 iV„ 



d£dz 
implying 
dM h 



M h 



10 12 M F 



f exp 



(1+2)" -(9) 



1O 12 M 
where 



-Ah 



t,' exp 



M h (z) 



10 12 M e 



(l + z)*dz, (10) 



(11) 



Following Fakhouri, Ma, & Boylan-Kolchin (2010), we as- 
sume A = 0.0104, i = 9.72 x 10" 3 , a = 0.133, /3 = -1.995, 
7 = 0.263 and n' = 0.0993. By integrating equation (10) we 
get 



[SM h ] mclg (z) 
1O 12 M 



Alt 



r 


' M h (z') ' 




1O 12 M 



a + l 



(i+ z y dz',(i2) 



which is the DM mass accreted between 2 and Zd via mergers 
with mass ratio £ > £ m i n . This quantity, normalized to the 
total DM mass of the halo at z = Zd, is plotted in Fig. 1 for 
a representative halo of mass Mh = 5 x 10 12 Mq at redshift 
Zd = 1.3, for a range of values of f; m i n . The plot shows that 
the most massive 2 ~ 2.2 progenitor of a typical z — 1.3 
halo is roughly half as massive as the descendant. However, 
only ~ 1/3 of the mass of the descendant has been acquired 
via mergers (defined as £ > 0.04; Fakhouri & Ma 2010). The 
rest is acquired by diffuse accretion. 

We note that the Millenium simulations, which we 
use to quantify merger rates, adopt a normalization of the 
mass variance as = 0.9, while the latest (7-year) analysis 
of the Wilkinson Microwave Anisotropy Probe experiment 
(WMAP7) favours a 8 ~ 0.8 (Komatsu et al. 2011). Though 
rescaling the numerical results to a different cosmology is 
not trivial (Angulo & White 2010), according to the ana- 
lytic approach of Lacey & Cole (1993) the merger rates for 
as ~ 0.8 can be at most ~ 10% higher than for the Mil- 
lennium choice. Changing the merger rates by this amount 
would not alter any of our conclusions. Detailed estimates 
of the merger rates in a WMAP7 universe will be avail- 
able in the near future from the analysis of recent TV-body 
simulations with updated cosmology (such as the Bolshoi 
Simulation; Klypin, Trujillo-Gomez, & Primack 2011). 



3.1.3 Minimum, merger mass-ratio £ m i n 

Not all DM accretion events contribute to the stellar-mass 
growth. In particular, very minor mergers are not expected 
to contribute significantly, because (i) their merging time 
can be extremely long (longer than the Hubble time) and 
(ii) only a very small fraction of their mass is in stars. For 
these reasons, only mergers with mass ratio larger than a 
critical value £ m - m will be relevant to the growth of the stellar 
component of the galaxy. 

The critical value of the satellite-to-main halo mass 
ratio £ m in can be identified on the basis of the merg- 
ing timescales (see Hopkins et al. 2010b and references 
therein). Here we adopt the results of Boylan-Kolchin, Ma, 
& Quataert (2008), who, based on iV-body simulations, es- 
timated the relationship between merging time i me rg of a 
satellite and dynamical time tdyn of the host halo. Boylan- 
Kolchin, Ma, & Quataert (2008) parametrize the orbits of 
the infalling satellites using circularity n = yl-? (where 
e is the eccentricity) and r c i rc (E)/r v i T , the radius of a cir- 
cular orbit with the same energy E as the actual orbit (or- 
bits characterized by larger values of r C uc(E)/r V ir are less 
bound). The merging timescale t melg as a function of mass 
ratio £, is then given by 

:c(E) 1C 



— exp (en) 



(13) 



tdyn In (l + |) 

with a' = 0.216, b' = 1.3, d = 1.9 and d! = 1.0 (Boylan 
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Figure 2. Merging time (in units of the main-halo dynamical 
time) as a function of the satellite-to-main halo mass ratio £ for 
fcireO^OAVir = 1 (thin red) and ?'circ(^')/ r vir = 0.65 (thick blue), 
for different values of the circularity r\ = 0.3 (dotted), n = 0.5 
(solid) and rj = 1 (dashed; Boylan-Kolchin, Ma, & Quataert 
2008). Only accretion events with merging time below the hori- 
zontal long-dashed line can contribute to the growth of the stellar 
mass of the central galaxy in the redshift range 1.3 — 2.2. 



Kolchin, Ma, & Quataert 2008). Equation (13) has been esti- 
mated for bound orbits (with orbit parameters measured at 
r v i r ), with £ in the range 0.025 < £ < 0.3. The halo dynamical 
time tdyn is denned as 



tdy 



\GM b 



1/2 



(14) 



where r v j r is the virial radius and Mh the mass of the main 
halo. It follows that idyn = (2/A) 1/2 H~ 1 , because, by def- 
inition, r^ iT = 2GM h /AH 2 , where H(z) is the Hubble pa- 
rameter at redshift z. So, for A = 200, tdyn = O.l-ff -1 inde- 
pendent of mass (Boylan-Kolchin, Ma, & Quataert 2008). 

As a result, the time lag t mei& between the time when 
the satellites enters the virial radius of the halo, and the 
moment when the satellites is accreted by the central galaxy 
depends on £ and z, but is independent of the halo mass. 
In this analysis, given the limited redshift interval, we can 
safely adopt a fixed value of £ m in- The smallest value of 
tu = H(z)^ 1 in the redshift range z = 1.3 — 2.2 is fir, mm = 
tn(z = 2.2) ~ 1.4 Gyr. The cosmic time between z = 2.2 
and z = 1.3 is ~ 1.8 Gyr ~ l-3tn,mm- Therefore, we assume 
that only mergers with t mcrg < 13t dyn (i.e. t morg < 1.3£h(z)) 
can contribute to the growth of the stellar component of 
the galaxy. Note that this approach is conservative, since 
our merging criterion £ morg ^ 1.3£h(z) gives an upper limit 
to the mass accreted via mergers by the descendant galaxy. 

In Fig. 2 we plot £ me rg/£dyn as a function of £ for 
different combinations of the values of the parameters r\ 
and r c i IC (E), spanning the entire range explored by Boylan- 
Kolchin, Ma, & Quataert (2008): r circ (S)/r v i r = 0.65, 1 and 



77 = 0.3,0.5,1. The critical ratio £ m in (denned such that 
tmcrg = 13tdyn = 1.34h) is in the range 0.02 < £ m in < 0.09. 
We can refine our estimate of £ m i n based on the distribu- 
tion of orbital parameters of infalling DM satellites in cos- 
mological iV-body simulations (Benson 2005; Wang et al. 
2005; Zentner et al. 2005; Khochfar & Burkert 2006; Wet- 
zel 2011). Although the details may vary from one study to 
another, the general consensus is that orbits are typically 
close to parabolic (E ~ 0) and relatively eccentric (with 
typical circularity rj ~ 0.5 for bound orbits; Benson 2005; 
Zentner et al. 2005; Khochfar & Burkert 2006). Thus, tak- 
ing as reference r c iic(E) /r v - a = 1 (the least bound orbits 
among those explored by Boylan-Kolchin, Ma, & Quataert 
2008) and r\ — 0.5 we obtain £ m i n ~ 0.03, which we adopt as 
our fiducial minimum mass ratio. Interestingly, this value is 
close to that adopted by Fakhouri & Ma (2010) (£ = 0.04) 
to separate diffuse accretion and mergers. Therefore, in the 
terminology of Fakhouri & Ma (2010) we conclude that only 
mergers (and not diffuse accretion) contribute to the growth 
of the stellar component of a central galaxy of a halo, in the 
redshift interval considered here. 

As anticipated above, an additional and independent 
argument to exclude very minor mergers is that sufficiently 
low-mass halos are expected to be star-poor (e.g. van den 
Bosch et al. 2007; Behroozi, Conroy, & Wechsler 2010, here- 
after B10). Of course, these low-mass halos can contain sig- 
nificant amounts of gas, from which stars can form. However, 
we can neglect this effect in our pure dry-merging evolution 
scenario. Following van der Wei et al. (2009) , we account for 
the fact that low-mass halos are star-poor by assuming that 
merging halos with mass Mh < 10 11 Mq do not increase the 
stellar mass of the galaxy. The halos hosting our galaxies 
typically have log M h /M ~ 12.5 - 13 at z ~ 1.3. For these 
halos the limit corresponds to £ m i n ~ 0.01 — 0.03, i.e. slightly 
less stringent than the value £ m j n ~ 0.03 obtained from 
dynamical considerations. Therefore, we can safely adopt 
fmin = 0.03 as our fiducial value, encompassing both dy- 
namical and star formation efficiency limits. 

To conclude this section we can use the formalism in- 
troduced above to compute the mass-weighted merger mass 
ratio 



(Om = -^r 2 1 

IL FMd t 

where F M = [d 2 M h /dzd£] 
weighted merger mass ratio 



(15) 



(Mh,^,z), and the number- 



(16) 



where Fn = d 2 iV merg /d2:d£(-Mh, £, z). In our model ($)m 
and (£)jv are independent of halo mass and redshift (see 
equations 7 and 9), and only weakly dependent on £ m i n . For 
Cmin = 0.03 we get (£) M ~ 0.45 and (Q N ~ 0.21. In other 
words, if we wanted to describe the halo merging history 
simply with a single number, we could say that even though 
most mergers have typical mass ratios £ ~ 0.2, most of the 
mass is accreted in higher mass-ratio mergers, typically with 
£ ~ 0.45. 
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Figure 3. Stellar to DM mass ratio 7^.»h (upper panel) and stellar 
mass M* (lower panel) as functions of halo mass Mh according to 
prescriptions (i) (Wll, thick dotted lines), (ii) (BIO, thick solid 
lines) and (iii) (L12, thick dashed lines) for dM,/dMi,. The cor- 
responding thin lines show the estimated systematic uncertainty. 
In prescriptions (ii) and (iii) the SHMR depends on z. We plot 
here the fits for z = 1.3 based on P(M*|Mh), as described in 
Section 3.2.1. 



3.2 Stellar-to-halo mass relation (dM*/dM h ) 

3.2.1 Assigning stellar mass to halos: M*(Mh) 

In general, the relationship between galaxy stellar mass and 
host halo mass depends on both the star formation history 
and the merger history (see BIO; Guo et al. 2010). In a dry- 
merger scenario, when a halo of mass Mh undergoes a merger 
with mass ratio £ the increase in DM mass is £Mh, and the 
increase in stellar mass is 7?.*h£Mh, where 7?.*h is the ratio 
of stellar to DM mass of the satellite. As 7?.*h is expected 
to depend both on satellite mass £Mh and on redshift, in 
general we have 

dM* 



dM h 



AM, 



(£,M h ,z)=K* h (£M h ,z). 



(17) 



At the time of this writing the stellar-to-halo mass relation 
(SHMR) is uncertain, mainly as a result of corresponding 
uncertainties in stellar mass measurements, and, at higher 
redshifts, of the lack of robust galaxy samples. The total sys- 
tematic uncertainty in log M* (at fixed Mh) is approximately 
~ 0.25 at z < 1, and possibly larger at higher redshift (BIO). 
Several SHMRs are available in the literature, providing the 
relation between M* and Mh as a function of redshift. Differ- 
ences between these models can be generally accounted for 
by the systematics mentioned above. As we will show in the 
rest of the paper, this is the main source of uncertainty in 
our evolutionary models. We will thus consider three recent 
estimates of the SHMR and investigate how they affect our 
conclusions. The three prescriptions described in more de- 
tail below are based on the measurements by: (i) Wake et al. 
(2011, hereafter Wll); (ii) BIO; (iii) Leauthaud et al. (2012, 



hereafter L12). Our study will show that our conclusions are 
robust with respect to the choice of the prescription. 

Prescription (i) In the framework of halo occupation 
distribution models, Wll find that in the redshift range 
1 < z < 2 the dark-to-stellar mass ratio does not depend 
significantly on redshift. According to the best-fitting re- 
lation of Wll, the median stellar mass M* of the central 
galaxy of a halo of mass Mh is given by 



M, = 6(M h )M h , 



where 

e(M h ) 



M t 
A M 



(Mh 



exp 



(— -0 

\M h J 



(18) 



(19) 



J M, 







au = 0.8 and M t = 0.98 x 

,2\ 



with A M = 1-55 x 10 1 

10 12 /i -1 Mq (D. Wake, private communication 2 ). In Fig. 3 
we plot M* and lZ*h = M* /Mh as functions of Mh according 
to this prescription together with the systematic uncertainty 
(0.25 dex in M» at given Mh). In summary, in this case we 
assume 7?.*h(M, z) = O(M), independent of z. This first 
prescription is a useful benchmark in our analysis, because 
the interpretation of the halo and stellar mass evolution is 
straightforward when the SHMR is independent of z. How- 
ever, there are reasons to think that the SHMR actually 
depends on z also at these redshifts. In fact, we note an 
important caveat with the Wake et al. SHMR: their halo 
occupation distribution model of the clustering data makes 
the implicit assumption that the SHMR is a power-law rela- 
tion (see discussion in section 3.2 of Leauthaud et al. 2011a). 
This is problematic in light of accumulating evidence that 
the SHMR is not well described by a single power-law re- 
lation, especially at high stellar masses where it steepens 
considerably. For this reason, we expect a 10-40% difference 
between the M m i n values reported by Wll and the true 
mean halo mass (with larger errors for cri og A/, > 0.25, where 
ciogM, is the scatter in logAf, at given Mh, due to statisti- 
cal errors). An example of the difference expected between 
M m i n and the true mean halo mass is shown in Leauthaud 
et al. (2011a, see their figure 3). 

Prescription (ii) B10 provide fits to the SHMR as 
a function of both halo mass and redshift, in the range 
5s.z<4. We take the correlation between halo mass Mh 
and stellar mass M* as given in B10 (their equations 21, 22 
and 25, and columns labelled "Free(/i,K)" in their Table 2) 
to define TZ* h {M h ,z) = M»/M h . The B10 fit for z = 1.3 is 
shown in Fig. 3 with the associated systematic uncertainty 
(0.25 dex in M» at given M h ). 

Prescription (iii) Recently L12 have studied in great 
detail the SHMR as a function of halo mass and redshift at 
z<l. To obtain a third independent estimate of the SHMR 
at high redshift we extrapolate the SHMR of L12 at z> 1. 
In this case, we define 72.*h(Mh,z) = M„/Mh, where the 
correlation between Mh and M* is given by the same fitting 
formula as in B10 (their equations 21, 22 and 25), with the 
following values of the parameters: M*,o,o = 10.78, M*,o,o = 
0.36, M» ?0>a2 = 0, Mi, = 12.40, M x , a = 0.38, /3 = 0.45, 
Pa = 0.026, 5 = 0.56, 8a = 0, 70 = 0.82, 7a = 1.86. The 
L12 fit for 2 = 1.3 is also represented in Fig. 3 with the 

2 These values have been corrected by the authors after publica- 
tion of Wll. 
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Figure 4. Stellar to DM mass ratio 7?.*h (upper panel) and halo 
mass Mj, (lower panel) as functions of stellar mass M* according 
prescriptions (i) (Wll, thick dotted lines), (ii) (BIO, thick solid 
lines) and (iii) (L12. thick dashed lines) for dM./dMi,. The cor- 
responding thin lines show the estimated systematic uncertainty. 
In prescriptions (ii) and (iii) the SHMR depends on z: here we 
plot the fits for z = 1.3 based on P(M^ L \M,), which are described 
in Section 3.2.2. 



associated systematic uncertainty (0.25 dex in M* at given 
M h ). 



3.2.2 Assigning dark-matter mass to galaxies: Mh(M*) 

In Section 3.2.1 we provided prescriptions to assign stellar 
mass to halos: for this purpose, we needed to compute the 
average stellar mass at given halo mass using the probability 
distribution P(Al,\Mh). In order to build the initial condi- 
tions of our models we will also need to solve the inverse 
problem of assigning DM mass to observed galaxies of given 
stellar mass. This case is the topic of this Section. 

Here the relevant probability distribution is P(Mh|M,). 
In prescriptions (ii) and (iii) of Section 3.2.1, the relation 
between M 4 and Mh is explicitly obtained from P(M*|Mh). 
P(M h \M») is related to P(M*|M h ) by 



P(M h \M t ) 



P(M.\M h )P(M h ) 
P{M.) 



(20) 



where P(Mh) and P(M«) are the stellar and halo mass func- 
tions. The average logarithmic halo mass at given stellar 
mass is then 



(log MhXM.) 



/ P(M,\M h )P(M h )logM h dM h 
J P{M t \M h )P(M h )dM h ' 



(21) 



independent of P(M«) (see, e.g., appendix in Leauthaud et 
al. 2010). We compute (logMh}(M*) by numerically inte- 
grating the above equation, taking P(Mh) from Tinker et 
al. (2008, consistently with B10 and L12) and P(M*|M h ) 
lognormal with logarithmic mean (logM*)(Mh), given by 



prescriptions (ii) and (iii) in Section 3.2.1, and variance 
°fog m, ( z ) (dependent on redshift, independent of Mh). In 
both prescriptions (ii) and (iii) we adopt 



<y log m, 



,(*) = y/x* + S*{z), 



(22) 



where s(z) = so + s z z, with x = 0.16, so = 0.07 and 
s z = 0.05 (see B10). The derived average value of logMh 
as a function of logM* is plotted in Fig. 4 (lower panel) 
at the reference redshift z = 1.3, for both prescription (ii) 
(B10) and prescription (iii) (L12) with the expected sys- 
tematic uncertainty (0.25 dex in logM*). In the case of the 
simpler prescription (i) we just invert equation (18) to ob- 
tain the value of log Mh associated to a given value of log M* 
(dotted curves in Fig. 4). We note that the predicted values 
of TZ,h (upper panel of Fig. 4) for the relevant stellar masses 
~ lO n M are in the range -1.5 < log^» h < - 2. These 
numbers are broadly consistent within the error bars with 
a higher-redshift extrapolation of the independent estimate 
by Lagattuta et al. (2010), based on gravitational lensing. 

As shown in Figs. 3 and 4, the SHMRs of the three con- 
sidered prescriptions differ at z ~ 1.3 in both shape and 
normalization. In addition, the SHMR evolves differently 
with redshift in prescriptions (ii) and (iii), while is inde- 
pendent of redshift in prescription (i). It follows that the 
stellar mass growth rate of the same galaxy is different in 
the three models, not only because different halo masses are 
assigned to the same descendant galaxy, but also because 
different stellar masses are assigned to satellite halos of a 
given mass. Though other choices of SHMRs would also be 
possible, we limit here to the three prescriptions described 
above, because they should give a sufficient measure of the 
effect of the current uncertainty on the SHMR. For instance, 
the SHMR obtained by Moster et al. (2010) lies in between 
L12 and B10 at low redshift (see figure 10 in L12). We ver- 
ified that, within the uncertainties, this is the case also at 
higher z, at least up to the highest redshifts relevant to the 
present investigation (z ~ 2.2). 



3.3 Dry- merger driven evolution of crn and R c 

(dcro/dM, and dP c /dM») 

The final ingredient for our model is the relation between 
evolution in stellar mass and that in velocity dispersion and 
effective radius, under the assumption of purely dissipation- 
less mergers between spheroids. The evolution of the observ- 
able quantities cro and R c is expected to depend non-trivially 
on the properties of the merger history, and in particular on 
the mass ratio £ and orbital parameters of the mergers (for 
instance, orbital energy E and modulus of the orbital angu- 
lar momentum L). In general, we can write 



do-p _ 

dM» ~~ dM : 



da °(Z,E,L) and dRc 



dM„ 



dP c 
dM* 



(£,P,L). 



(23) 



In principle, these expressions can be estimated using N- 
body simulations of hierarchies of dissipationless mergers 
(e.g. Nipoti, Londrillo, & Ciotti 2003; Boylan-Kolchin, Ma, 
& Quataert 2006; N09b). However the parameter space £ — 
E — L is prohibitively large and it has not been extensively 
explored so far. As a first-order approximation, we simplify 
the treatment by neglecting the dependence on E and L, so 
that we have 
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Figure 5. Distribution of slopes a* and a*„ as functions of peri- 
centric radius (left panels) and eccentricity (right panels) for our 
set of minor-merger simulations with mass ratio £ = 0.2 and 
/3 n ~ 0.6 (see Section 3.3). The vertical bars indicate 1-ct scat- 
ter due to projection effects. The dotted horizontal lines indicate 
the analytic estimates (oj = / CT and a* R = f R ) based on equa- 
tions (28-29). 



da da , 

dMT = dE e) and 



dR c 



dR c 



(£)■ 



(24) 



In the present work we will approximate the quantities 
d<7o/dM*(£) and di? /dM»(£) with the analytic formulae 
described in Section 3.3.1, which are supported by the re- 
sults of iV-body simulations presented in Section 3.3.2. 



3.3.1 Analytic estimates 

In the simple case of parabolic orbit and negligible mass loss, 
the evolution of the virial velocity dispersion er v in a merger 
with mass ratio £ can be written (see Naab, Johansson, & 
Ostriker 2009; Oser et al. 2012) as 



U 



d In cr v 1 

d In M, ~ ~2 



1 



ln(l + fr) 



while the gravitational radius r g evolves according to 



Sn 



din r g 
dlnM, 



= 2 



ln(l + ge) 
ln(l + 



(25) 



(26) 



We defined e = aj j/aj, where er v ,a is the virial velocity dis- 
persion of the accreted system of mass £M„ . Note that the 
quantities er v and r g refer to the total (DM plus stars) dis- 
tribution of the galaxy, so the above expressions are strictly 
valid for two-component systems only if light traces mass. 
By assuming also a size-mass relation r g oc M* R , we can 
write 



1-/3 f 



so that, for fixed /3r, we obtain 



(27) 



U(0 



and 



MO 



ln(l + 



ln(l + g 2 -^) 

W+T) 



(28) 



(29) 



(see also N12). 

Assuming for simplicity oq oc ct v and R c oc r g , we obtain 



^(0 = ^U(0, so.oocM-^' 



and 
dR, 



Rc 



4M® = Wj'® 



so R e oc mI r(0 . 



(30) 



(31) 



This approach takes into account in detail the dependence 
on the merging mass ratio, but assumes only parabolic orbits 
and neglects mass-loss and structural and dynamical non- 
homology (because <to and R e are assumed proportional to 
the virial radius and gravitational radius of the total mass 
distribution). In order to model these additional complexi- 
ties it is necessary to introduce complementary information 
based on iV-body simulations. 



3.3.2 N-body simulations 

We describe here the sets of TV-body simulations of dissi- 
pationless galaxy mergers (in which the stars and DM are 
treated as distinct components) that we use to support the 
analytic estimates introduced in the previous Section. The 
results of the iV-body experiments can be parametrized by 
power-law relations between &o (or R e ) and M». We ex- 
pect that a family of merging hierarchies can be described 
by cro oc M*° , where a % is characterized by a distribution 
with mean value (a*) and standard deviation <5aJ, account- 
ing for the diversity of merging histories and the range in 
mass ratios and orbital parameters (Boylan-Kolchin, Ma, & 

Quataert 2006; N09b). Similarly we expect 3 R c oc M" R , 
with a* R distributed with mean value {a* R ) and standard 
deviation 5a* R . Numerical explorations allow us to evaluate 
how much the average virial expectation is affected by non- 
homology effects, and also to estimate the scatter around the 
average relations. N09b ran simulations of both major and 
minor mergers of spheroids, exploring extensively the pa- 
rameter space only for major mergers. Therefore we adopt 
here the results for major mergers from N09b, and we sup- 
plement them with a new set of minor-merger simulations 
(see also Nipoti 2011). 

The major-mergers hierarchies of N09b (a total of 22 
equal-mass mergers, differing in orbital energy, angular mo- 
mentum and dark-to-luminous mass ratio of the progeni- 
tors) are characterized by (a*) = 0.084, <5a* = 0.081 and 
(ct R ) — 1.00, Sa* R — 0.18, which we adopt as our fiducial 



3 The quantity a„, which measures the merging-induced varia- 
tion in log R a for given variation in log M* , must not be confused 
with [3 R , which is the logarithmic slope of the observed size-mass 
relation of ETGs. Only if a* R ~ /3 R (which in fact is not the case) 
the size-mass relation would be preserved by dry mergers (Nipoti, 
Londrillo, & Ciotti 2003). 
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values for £ ~ 1 mergers. The average values of these distri- 
butions are consistent with the predictions of equations (28- 
29), which in the case of major mergers give a* R = = 1 

and a* a = / CT (1) = 0, even though the simulations tend to 
suggest (a*) > 0, which is likely to be a consequence of 
mass loss (N09a; N09b). We note that most of the simula- 
tions in N09a have progenitors with dark-to-luminous mass 
ratio Afh/M* = 10 (model A in N09a), while only four have 
M h /M« = 49 (model D in N09a), which is expected to be 
more realistic. However, we verified that virtually the same 
values of a* R and a% reported above are found for either 
subsample. 

In order to estimate the effects of non-homology and of 
the range of orbital parameters in the case of minor merg- 
ers, we ran a new set of 13 iV-body dissipationless simu- 
lations. In these simulations we model the encounter be- 
tween a spherical galaxy with stellar mass M* and DM mass 
10M„ (specifically, model A in N09a), and a galaxy with the 
same stellar and DM distributions, with stellar mass 0.2M, 
and DM mass 2M*. The size of the less massive galaxy is 
0.36 of that of the main galaxy, so that the two galaxies lie 
on the size-stellar mass relationship R c oc r g oc MS R with 
fin ~ 0.6. The simulations were performed with the paral- 
lel iV-body code FVFPS (Fortran Version of a Fast Poisson 
Solver; Londrillo, Nipoti, & Ciotti 2003; Nipoti, Londrillo, 
& Ciotti 2003), based on the Dehnen (2002) scheme. In the 
simulations the more massive galaxy is setup as an equi- 
librium two-component system with N* ~ 2 x 10 5 stellar 
particles and Nh — 10 6 DM particles, while the satellite 
has N* ~ 4 x 10 4 and N h ~ 2 x 10 5 (DM particles are 
twice as massive as stellar particles). We verified that these 
systems do not evolve significantly when simulated in iso- 
lation. In each merging simulation, at the initial time the 
distance between the centres of mass of the two systems 
equals the sum of their virial radii. The simulations differ 
in the initial relative velocity between the two systems, i.e. 
in the values of the orbital parameters: here we use eccen- 
tricity e and pericentric radius r per i calculated in the point- 
mass approximation (see table in Nipoti 2011). Consider- 
ing the entire set of 13 simulations, e is distributed with 
(e) ~ 0.93 and Se ~ 0.10, while r per i (in units of the main- 
halo virial radius r v i r ) is distributed with (r pol -i/r v h-) ~ 0.17 
and <5(r- pC ri/Vvir) — 0.09 (for bound orbits the circularity rj is 
distributed with (77) ~ 0.53 and Srj ~ 0.12). These distribu- 
tions compare favourably with those found in cosmological 
iV-body simulations. For instance, there is good overlap be- 
tween our distributions of parameters and those found for 
halo mergers (Benson 2005; Wang et al. 2005; Zentner et al. 
2005; Khochfar & Burkert 2006; Wetzel 2011), though we 
are somewhat biased towards less bound orbits (for instance 
as compared to Wetzel 2011). However, the scatter in the 
orbital parameters of our simulations is comparable to that 
found by Wetzel (2011). 

The 13 minor-merger simulations are followed up to viri- 
alization and the structural and kinematic properties of the 
remnants (defined selecting only bound particles) are mea- 
sured as described in N09a. The values of a* R and qJ for 
these 13 simulations are plotted in Fig. 5 as functions of e 
and r pcr i/r v i r : overall, we obtain (a%) = —0.21, Sa% = 0.097 
and {a* R ) = 1.60, Sa* R = 0.36. The horizontal lines show the 
predictions of equations (28-29) for £ = 0.2 and /3 R ~ 0.6, 
which are generally consistent with the average values found 



in the simulations (with the exceptions of accretions on 
very radial orbits, i.e. small r pcr i). We note that in the 13 
minor-merging simulations we used models with relatively 
low dark-to-luminous mass ratio (Mh/M* = 10; model A 
in N09a). To assess the dependence of our results on the 
value of Mh/M», we reran two of these simulations with 
the same orbital parameters (e = 1, r pcr i = and e = 1, 
r P eri/fvir — 0.2), but using galaxy models with Mh/M* = 49 
(model D in N09a). In these cases we used N* ~ 10 5 and 
Nh ~ 2.5 x 10 6 for the main galaxy, and N* ~ 2 x 10 4 and 
Nh — 5 x 10 5 for the satellite. We found that the higher- and 
lower- Mh/M* models lead to similar values of a* and a* R , 
with differences on the angle- averaged values always smaller 
than the scatter due to projection effects. 

The fact that the numerical values of (a R ) and (o£) for 
both £ = 1 and £ = 0.2 are in good agreement with the virial 
predictions (28-29) suggests that we can use equations (30- 
31) to describe the average evolution of central velocity dis- 
persion and effective radius (see also Oser et al. 2012). Our 
numerical study also finds significant scatter in a% and in 
a* R , due to projection effects (vertical bars in Fig. 5) and on 
the range of orbital parameters. This scatter must be taken 
into account when considering the dry-merger driven evolu- 
tion of the scaling relations of ETGs (N09a; N09b; Nipoti 
2011, see also Section 5.2). 



3.4 Putting it all together 

In this Section we describe how to combine the ingredients 
discussed in the previous Sections to answer the following 
question. Given a galaxy of known stellar mass, size, and 
stellar velocity dispersion at Zd what did the progenitor at a 
higher z look like? In the following the progenitor is defined 
as the galaxy living in the most massive of the progenitor 
halos that by Zd have merged into the halo of our galaxy. 

The first step is to assign a halo mass to a descendant 
galaxy observed at redshift Zd'- once a SHMR is assumed, the 
halo mass is obtained univocally from the measured stellar 
mass using equation (21). Then, for a given halo mass at Zd, 
the evolution of the observable quantities can be obtained 
as follows. The growth in stellar mass can be written as 

d 2 M„ 



H*h(ZMh,z)ZMh- 



zdt, dzd£ 

where Mh = Mh(z) is the total mass of the halo (equation 5). 
By integrating over £ we obtain 

M h (z) 



dM, = ~AI M (z)M h (z) 



10 12 M, 



© 



{1 + zY dz, 



where 



Im{z) 



n*h(£Mh,z)^ +1 exp\^i ) 



By integrating over z we obtain 
M»{z) = M,(z d ) - AI a , M {z), 
where 

" M h (z') " 



(33) 



(34) 



(35) 



I a ,Al(z) = / Im(z') 



(1 + z')" dz' 



10 12 M Q 

The evolution of central velocity dispersion is given by 



(36) 
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Figure 6. Location of the z ~ 1.3 galaxies (circles; sample VI) 
and of the predicted z = 2.2 progenitors (squares, triangles and 
stars) in the stellar mass-halo mass plane for model W (top panel), 
model B (intermediate panel) and model L (bottom panel). For 
comparison we also plot with thick lines the z = 1.3 and z = 2.2 
fits of B10 and L12, and the (redshift-indcpcndcnt) fit by Wll. In 
each panel thin solid lines indicate the statistical scatter 0"i og M» 
of the SHMR represented by the thick solid line: in all cases we 
assume <r logMi as given by equation (22), fixing z = 2.2. 



=f4 nMtM h ,A < ?^( Z ,t,M h ), (37) 



dzd£ 



M* * dzd£ 



where Mh = Mh(z) is calculated from equation (5) and 
M* = M*(z) is calculated from equation (35). By using 
equation (9) and integrating over f we obtain 



din ctq = —AI rT {z) 



M h (z) 
M*(z) 



M h (z) 



10 12 M, 







where 



(1 + z) 71 dz, 



/ CT (0K»h(fMh,z)^ +1 exp (J ) 



Finally, by integrating over z we get 

ln Mz) = _ AI {z) 

<To(Zd) 

where 



I a ,a{z) = \ Icr(z') 



MJz') 



1O 12 M 



(1 + z')" dz'. 



(38) 



(39) 



(40) 



(41) 



Similar equations for the evolution of R B can be obtained by 
replacing <to with R c , and the subscript a with the subscript 
R in equations (37-41). 



MODEL PREDICTIONS: 
PROGENITORS 



HIGH-REDSHIFT 



We now turn to building specific realizations of our dry- 
merger evolution models and comparing them to observa- 
tional data sets. To explore model uncertainties, we first 
computed models for the following range of parameters and 
prescriptions: minimum merger mass ratio between f m i n = 
0.01 and £ m j n = 0.05; prescription for dM, / dMh (i), (ii) or 
(iii); mass-size slope /3r = 0.5 — 0.8. It turns out that the 
predicted evolution of size, velocity dispersion and stellar 
mass depends almost exclusively on the adopted prescrip- 
tion for dM»/dMh, while the other parameters have rela- 
tively little effect. Therefore we focus here on models with 
Cmin = 0.03 (see Section 3.1.3) and /3r = 0.6 (the average 
value of din i? c /d ln M* found by N12, almost independent 
of redshift). 

In order to illustrate the effects of the main uncertainty 
we show the results of three models using different prescrip- 
tions of dAft/dMh: prescription (i) for model W, prescrip- 
tion (ii) for model B, and prescription (iii) for model L (see 
Section 3.2). The choice of the model also affects how we 
assign halo masses to each of our z ~ 1.3 observed galax- 
ies. Within each model we use the corresponding prescrip- 
tion at the appropriate redshift. The rest of this Section 
is organized as follows. In Section 4.1 we describe the size, 
velocity-dispersion and mass evolution of individual galax- 
ies, presenting results obtained taking as descendant z ~ 1.3 
galaxies with measures of oa (sample VI). In Section 4.2 we 
focus on the question of the global size evolution of ETGs at 
high- ,z , taking as descendants the ETGs with no measures 
of (To (sample Rl). 
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Figure 7. Distribution of observed ETGs and model galaxies in the stellar mass- velocity dispersion (panel a), stellar mass-effective 
radius (panel b), and velocity dispersion-effective radius (panel c) planes. The models trace the evolution of the ETGs of the 1 < z < 1.6 
sample VI (circles) from the observed redshift back to z = 2.2 (triangles, squares and stars). Pentagons represent z > 1.8 observed 
ETGs of sample V2. In panel (a) the vertical dashed lines indicate the minimum stellar mass necessary to measure velocity dispersion 
at z = 1.3 and z = 2.2 with current instruments, fn each panel the solid line shows the corresponding scaling relation (with 1-cr scatter; 
dashed lines) for the massive local ETGs of the SLACS sample Auger et al. (2010). The correlation between velocity dispersion and R c 
(not reported in Auger et al. 2010) is logi? e / kpc = (1.75 ± 0.39) log(<r c 2/200kms _1 ) + 0.65 ± 0.05, with intrinsic vertical scatter 0.18 
in logii e . Here a e 2 is the velocity dispersion measured within Re/2, which we assume to be related to uq (measured within R c /S) by 
log cr = logff e 2 +0.024. 



4.1 Size, velocity-dispersion and mass evolution of 
individual galaxies 



4-1.1 Evolution in stellar mass and stellar-to-halo mass 
ratio 



We consider here results obtained taking as reference sam- 
ple VI, i.e. the ETGs at z ~ 1.3 with measured o~o. The 
results obtained for models W, B and L, applied to the 13 
descendants, are shown in Figs. 6-8. Given the small sam- 
ples with measured <ro available at the moment, this exercise 
does not yield stringent constraints on dry-merger models 
(yet). Those will be derived in the next section with the aid 
of larger samples without measures of stellar velocity dis- 
persion. However, our calculations illustrate the diagnostic 
power of large samples with measured stellar velocity dis- 
persion, which are expected to be available soon. As an aid 
to forecast the outcome of future experiments, we provide 
simple fitting formulae that describe the predicted evolution 
of detailed properties of galaxies. 



The 13 galaxies of sample VI are assigned halo masses as 
described in Section 3.2.2. By considering three different 
SHMRs we can estimate systematic uncertainties in halo 
mass for given stellar mass, including those arising from un- 
certainties in stellar mass estimates, which, for fixed IMF 
are of the order of 0.05 — 0.1 dex in the considered redshift 
range (Auger et al. 2009; Newman et al. 2012). The halo 
masses for the 13 galaxies of sample VI at the observed 
redshift are in the range 10 12 <, M h /M < 2 x 10 13 . As ex- 
pected from the curves shown in Fig. 4, halo masses tend to 
be higher in model B than in model L, while intermediate 
halo masses are predicted by model W. This is clearly seen 
in Fig. 6, where the reference galaxy models are plotted in 
the M^-M* plane as filled circles. The stellar mass evolution 
predicted by the models can be also inferred from Fig. 7 [in 



12 Nipoti et al. 



M*-o"o and M,-R c planes; panels (a) and (b)], and Fig. 8 (in 
the redshift-stellar mass plane; bottom panel). 

It is apparent that model B predicts stronger evolution 
in stellar mass than models W and L. The main reason for 
this difference is that the BIO SHMR at z > 1 is character- 
ized by low values of TZ, h = M»/M h at M» > 1O U M , with 
TZ„h decreasing for increasing mass (Fig. 4). Therefore, in 
model B M* ~ 10 11 Mq galaxies are associated with quite 
massive halos, for which the merger-driven mass-growth rate 
is found to be higher (Fakhouri, Ma, & Boylan-Kolchin 
2010). In addition, these mergers are relatively star-rich, be- 
cause of the shape of the SHMR at these high halo masses 
(Fig. 3) , which implies that these systems systematically ac- 
crete lower-mass galaxies with higher baryon fraction. Ac- 
cording to model B, stellar mass increases by factors between 
~ 1.4 (for the least massive galaxies) and ~ 2.3 (for the most 
massive) in the time span between z ~ 2.2 and z ~ 1.3 (see 
Fig. 6, intermediate panel). 

Models W and L predict significantly less evolution in 
stellar mass. In these cases the increase in M* from z ~ 2.2 
to 2 ~ 1.3 is between ~ 20% for the least massive systems 
and ~ 50% for the most massive (see Fig. 6, top and bot- 
tom panels). Even though the samples are small it is clear 
that the predicted progenitors tend to have lower M* than 
the observed galaxies (see Figs. 7 and 8). However, the dis- 
crepancy can be at least partly ascribed to selection effects: 
galaxies with M* <C 10 n Mo at z ~ 2 are too faint for 
a velocity dispersion measurement with current technology, 
while very massive galaxies might not be sampled by our 
lower redshift survey, either because they are very rare or 
because they have too low surface brightness. 

A similar tension is observed between the predicted evo- 
lution of the dark-to-luminous mass ratio 7Z.*h, an d that 
measured using abundance matching techniques. Although 
this comparison depends on the assumed SHMR, in general 
dry mergers tend to move galaxies away from the curves. 
The smaller deviation is observed for model B: in this case 
Mh is typically high compared with the SHMR, but the 
deviations are within the estimated scatter (Fig. 6, inter- 
mediate panel) . For models W and L the model progenitors 
tend to deviate from the SHMR more than the related scat- 
ter (Fig. 6, top and bottom panels). Adding star formation 
to our models would not change the overall behaviour. In 
fact, star formation only makes 72. *h increase faster with red- 
shift. Thus, the predicted positions of the progenitors in the 
M»-Mh plane (Fig. 6) would be shifted horizontally towards 
lower masses (thus reducing the deviation from the SHMR 
for models W and L, but increasing it for model B). Over- 
all, the results shown in Fig. 6 indicate that the SHMR and 
its redshift evolution are critical constraints for dry-merging 
models. Given that 72*h depends on mass, unequal mass dis- 
sipationless merging moves galaxies in a non trivial manner 
in the TZ^-M^ plane, in general away from the redshift de- 
pendent SHMR. A potential caveat is the SHMR is derived 
for all galaxies, not just ETGs. However, in the range of 
masses considered here the vast majority of central galaxies 
are indeed ETGs, and therefore this is not a concern. 

4-1.2 Evolution in velocity dispersion 

A galaxy undergoing a dry merger with a lower velocity- 
dispersion system is expected to decrease its velocity dis- 
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Figure 8. Predicted redshift evolution of central velocity disper- 
sion (to (top panel), effective radius R c (intermediate panel) and 
stellar mass Af* (bottom panel) for the ETGs of sample VI (cir- 
cles), according to the three different models (thin curves). The 
corresponding thick curves indicate the average values (log <jq ) , 
(logi? c ) and (logM*) as functions of z. Empty pentagons repre- 
sent ETGs observed at z > 1.8 (sample V2). 



persion (Nipoti, Londrillo, & Ciotti 2003; Naab, Johansson, 
& Ostriker 2009). For this reason our predicted z ~ 2.2 
progenitors tend to have higher ao than their z ~ 1.3 de- 
scendants (see top panel in Fig. 8, and panels a and c in 
Fig. 7). However, the effect is small. In the case of models 
W and L the variation in ao is < 5%. The more strongly 
evolving model B predicts variations up to ~ 15%. 

The combination of this weak change in ao and of the 
significant variation in stellar mass leads to predicted z ~ 2.2 
progenitors with substantially larger ao than local ETGs 
with similar stellar mass [Fig. 7, panel (a)]. At the mo- 
ment the reference sample of z > 1.8 ETGs with measured 
ao (sample V2) consists of only 4 galaxies. Three of them 
have M* > 1.5 x 10 1 Mq and cannot be dry-merging pro- 
genitors of our ETGs. The fourth galaxy (the least massive, 
with logAf./Mo = 10.85) appears to lie on the local M t - 
ao relation, with lower ao than all our model progenitors. 
Lower mass galaxies are below the current limits. 

We conclude by emphasizing that a strong prediction of 
the dry-merger model is that there should be a population 
of galaxies with high (~ 300 km s -1 ) stellar velocity disper- 
sion and stellar mass in the range 10.5 < log M*/Mq < 11. 
This prediction should be testable in the near future. In 
the short term, sensitive multiplexed near infrared spectro- 
graphs about to be commissioned on large telescopes [e.g. 
the Multi-Object Spectrometer for Infra-Red Exploration 
(MOSFIRE) on Keck; McLean et al. 2010] will be able to 
provide such samples at z > 1.5, where CaH&K and the 
Gband region are redshifted into the Y and J bands. In the 
longer term, the Near-Infrared Spectrograph (NIRSPEC) 
on the James Webb Space Telescope will be able to ex- 
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Table 2. Parameters of the best-fitting linear correlations 
{log M*/Mq) = a M Iog(l + z) + 6 M , (lo gj R c /kpc) = a fl log(l + 
z) + b R and (log cr / km s -1 } = a a log(l + 2) + b CT . 



Model 


I'M 




11 R 


b R 




b a 


W 


-0.67 


11.23 


-0.80 


0.61 


0.065 


2.40 


B 


-1.48 


11.52 


-1.93 


1.01 


0.217 


2.35 


L 


-0.60 


11.21 


-0.71 


0.58 


0.056 


2.41 



The fits represent the average evolution over the redshift 
interval 1 < 2 < 2.5 of the 13 ETGs of sample VI (thick curves in 
Fig. 8), according to models W, B and L. 



tend velocity dispersion measurements to fainter galaxies 
and higher redshifts. 



4-1.3 Evolution in size 

We discuss here the predicted evolution in size and in the 
size-mass relation for ETGs of sample VI. As expected, all 
models predict progenitors more compact than the descen- 
dants. Typically the relative variation in size is larger for 
more massive galaxies (see also Oser et al. 2010). As for 
other observables, the size evolution is stronger in model B 
than in models W and L [see panels (b) and (c) in Fig. 7, 
and intermediate panel in Fig. 8]. Depending on the mass 
and redshift of the descendant, model B predicts an increase 
in R e of a factor of 1.3 — 2.8 from z ~ 2.2 to z ~ 1.3, while in 
the same redshift range models W and L predict at most a 
factor of ~ 1.6 increase in R e . Given the smallness and het- 
erogeneity of our reference higher-z sample V2, we cannot 
draw quantitative conclusion on the size evolution consid- 
ering only galaxies with measured velocity dispersion. We 
defer the comparison of predicted and observed size evo- 
lution to Section 4.2, in which we will consider the larger 
samples Rl and R2. 



4-1-4 Describing the evolution of M,, R e and 00 

In Fig. 8, together with the evolutionary tracks of the in- 
dividual galaxies of sample VI, we plot also, as functions 
of redshift, the corresponding average quantities (logM*), 
(logJ? c ) and (log (Jo). For convenience we provide linear 
fits to the average evolution in Table 2. These fits can 
be used to estimate the stellar-mass, size, and velocity- 
dispersion evolution predicted by our models for a typical 
massive ETG in the redshift range l<z<2.5. In particu- 
lar, we parametrize the evolution of the three observables 
as M, tx (1 + z) aM , Re oc (1 + z) aR and a oc (1 + z) a " : 
considering the three models, the power-law indices lie in 
the following ranges: —1.5 < sm < — 0.6, —1.9 < an ^ — 0.7 
and 0.06 < a a < 0.22. Combining the predicted mass and 
size evolution, we find that the effective stellar-mass surface 
density (which measures galaxy compactness) is predicted 
to evolve as M*/R^ oc (1 + z) 0,8-2 ' 4 in the redshift range 
1 <z<2.5. 



1 ^Observed ((z) = 2.2) 




10.5 11 10.5 11 



log M,/M log M,/M 

Figure 9. Distribution in the M*-R c plane of galaxies observed 
at 2 > 2 (empty pentagons; sample R2) and of 2 = 2.2 progenitors 
predicted by our models for the 150 descendant quiescent galaxies 
at 1 < 2 < 1.6 (sample Rl), for the three choices of SHMR 
(W,B,L). Symbols are the same as in Fig. 7. In each panel the 
solid line indicates the best-fit to the observed 2 > 2 data, while 
the dashed lines indicate the associated observed scatter. 
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Figure 10. Cumulative distributions of the offset in logiJ e from 
local (SLACS; Auger et al. 2010) Re-M* relationship for observed 
galaxies at 2 = 2 — 2.6 (sample R2) and for 2 = 2.2 progenitors 
predicted by models W, B and L for sample Rl. For the mod- 
els the distributions are computed considering only galaxies with 
with M* > 10 10,45 Mq (i.e. adopting the same cut in stellar mass 
as for the observed sample). 
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4.2 Global size evolution of early-type galaxies 

In this section we apply our models to predict the progeni- 
tors of sample Rl, i.e. 150 quiescent galaxies with 1 < z < 
1.6. 

Figure 9 shows the progenitors of sample Rl in the M*- 
R c plane, together with the observed population of quiescent 
galaxies at 2 < z < 2.6 (sample R2). In the same diagram we 
show the best-fit to the sample R2 data log R c / kpc = 0.14+ 

0. 59(log M*/Mq - 11), with observed scatter 5 log Re = 0.23 
at given M*. In all cases, the model progenitors populate 
mostly the region above the stellar mass-size relation, while 
there are no massive progenitors as compact as some very 
dense ETGs observed at z > 2. It is apparent that all mod- 
els tend to predict progenitors with lower mass than the 
observed population at z > 2. However, in all models there 
is a significant number of objects with stellar mass in the 
range 10.45 < log M,/Mq < 11.5 spanned by the observed 
ETGs. 

In order to quantify the difference between the predicted 
progenitors and the observed high- z galaxies, we therefore 
select model progenitors with log M,/Mq > 10.45 and com- 
pute for each of them the vertical (i.e. in log R e at fixed 
M») offset Alogi? c with respect to the local [Sloan Lens 
ACS Survey (SLACS); Auger et al. 2010] M*-R c correla- 
tion logi? e /kpc = 0. 81 (log M,/M Q - 11) + 0.53. For com- 
parison, we compute the same quantity for the ETGs ob- 
served at z>2. The parameter Alog\R c is a normalized 
measure of compactness. By construction, normal (local) 
ETGs have A log R c distributed around zero. Negative val- 
ues of A log R e indicate galaxies more compact than av- 
erage. The cumulative distributions of the vertical offset 
Alog\R c , shown in Fig. 10, clearly indicate that the pre- 
dicted progenitors are more dense than local galaxies (me- 
dian A log R c ~ —0.1, i.e. -R c /i?o, local ~ 0.8), but not as com- 
pact as observed (z>2) galaxies (median Alog_R e ~ —0.4, 

1. e. R e /Re,iocai ~ 0.4). The progenitors tend to be more com- 
pact in model B than in model W and L, but definitely not 
enough to match the observed galaxies. In all cases, it is clear 
that the the model progenitors and the observed galaxies do 
not belong to the same population (probability< 10~ 7 based 
on a Kolmogorov-Smirnov test). 

Figure 11 illustrates the distribution of progenitors of 
sample Rl in the M*-Mh plane. This analysis confirms and 
strengthens the results of the analysis of the smaller sample 
VI described in Section 4.1.1. The high-z progenitors pre- 
dicted by dry-merging models deviate substantially from the 
SHMR at the corresponding redshift. Only in model B the 
the discrepancy is marginally consistent with the scatter of 
the SHMR. 

Our findings suggest that a ACDM-based pure dry- 
merging model cannot explain the observation of ultra- 
compact massive quiescent galaxies at z > 2. The discrep- 
ancy cannot be reduced by dissipative effects, which work in 
the opposite direction. Furthermore, even though the SHMR 
is quite uncertain at these redshifts, our results are robust 
and hold for all three SHMRs that we have tested here. The 
underlying physical reason is that in a pure dry-merging 
model fast evolution in size is necessarily associated with 
fast evolution in stellar mass. Therefore, if the progenitors 
of z ~ 1.3 galaxies are forced to be as dense as the observed 
galaxies at z ~ 2.2 they cannot be as massive. 
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Figure 11. Location of the predicted z = 2.2 progenitors 
(squares, triangles and stars) in the stellar mass-halo mass plane 
for model W (top panel), model B (intermediate panel) and model 
L (bottom panel) of the 150 observed galaxies at 1 < z < 1.6 
(sample Rl). For comparison we also plot with thick lines the 
z = 2.2 fits of B10 and L12, and the (redshift-independent) fit by 
Wll. Thin solid lines indicate the statistical scatter <?i og m« °f the 
SHMR: in all cases we assume &i os m, a s given by equation (22), 
fixing z = 2.2. 
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Figure 12. Same as Fig. 7, showing the future evolution of the z ~ 1.3 ETGs (sample VI) to z = 0.19 (the median redshift of the 
SLACS sample) for models L and B. The symbols are the same as in Fig. 7. 



5 CHECKING MODEL PREDICTIONS: 
LOW- REDSHIFT DESCENDANTS 

The main focus of this paper is the evolution of ETGs in the 
relatively short time span (~ 1.8 Gyr) between z ~ 1.3 and 
z ~ 2.2, in which most of the size evolution of ETGs appears 
to happen. We have demonstrated that ACDM-based dry- 
merger models have difficulties producing a fast enough size 
evolution in this redshift range. However it is important to 
perform a consistency check and compare our predictions 
with the milder size evolution observed between z ~ 1.3 
and 2 ~ 0. We consider only the evolution of sample VI, 
taking advantage of the diagnostic power of stellar velocity 
dispersion measurements. 

In order to extend our models to z ~ we need the 
SHMR at z < 1. For this reason we restrict our analysis to 
models B and L, for which the SHMR is well measured in 
this redshift range (BIO; L12, see Section 3.2.1). We leave 
all other model parameters unchanged. A potential concern 
is that the arguments used in Section 3.1.3 to constrain the 
value of £ m i n between z ~ 2.2 and z ~ 1.3 do not necessarily 
apply to the longer time span between z ~ 1.3 and z ~ 0. 
However, we verified empirically that the predicted evolu- 
tion from z ~ 1.3 to z ~ does not depend significantly 
on the specific choice of £ m i n . In addition, we recall that we 
are assuming that our ETGs remain central halo galaxies as 



they evolve. While this is appropriate for massive galaxies at 
z > 1, at z <C 1 some of them might become satellite galax- 
ies in clusters. However, this is a minor effect, since even 
in the local Universe the vast majority of massive galaxies 
(M» >1O 11 M0 are believed to be central (see Section 3.1). 
We conclude that an extension of our models down to z ~ 
is sufficiently accurate for our purposes. 

5.1 Predicted properties of z ~ descendants 

The location of the low-redshift model descendants of sample 
VI in the M,-R c -ao space is shown in Fig. 12. For compari- 
son, the observed local (SLACS) correlations are plotted in 
Fig. 12. For consistency, we have computed the evolution of 
model galaxies until z = 0.19, the median redshift of the 
SLACS sample (Auger et al. 2009). 

The low-redshift descendants are found relatively close 
to the local observed correlation, albeit with considerable 
scatter (see Section 5.2). As for the higher redshift interval, 
model B predicts faster evolution than model L. In partic- 
ular, we note that model B tends to "overshoot" the local 
M»-ffo relationship, predicting massive descendants with ve- 
locity dispersion generally lower than that of observed local 
ETGs of similar mass, while the local descendants predicted 
by model L have a consistent with observations [panel (a) 
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cuss in the next section, explaining the tightness of the local 
scaling relations is a much more formidable challenge. 
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Figure 13. Same as Fig. 10, but for the z = 0.19 predicted 
descendants of the z ~ 1.3 ETGs (sample VI) for models L and 
B, and, for comparison, for the observed ETGs of the SLACS 
sample ((z) = 0.19; Auger et al. 2009). 



in Fig. 12]. tn contrast, model B performs somewhat better 
than model L when compared with the local M*-R e rela- 
tion [panel (b) in Fig. 12], though in neither case the re- 
sults are very satisfactory. This is shown quantitatively by 
Fig. 13, plotting the cumulative distributions of the vertical 
offset Alogi? c from the local M*-i? c relation (introduced 
in Section 4.2) for the model z — 0.19 descendants and 
for the observed SLACS galaxies. Not only the descendants 
tend to be, on average, too compact (the median offset is 

A log .Re < 0.07 for model B and Alogi? e < 0.15 for 

model L), but also their distribution in the M*-R c plane is 
characterized by quite large scatter (the predicted cumula- 
tive distributions are much shallower than the observed one; 
see Fig. 13). According to a Kolmogorov-Smirnov test, the 
probability that the model descendants and the observed 
galaxies belong to the same population is 0.1 for model B 
and 0.005 for model L. 

It is also instructive to study the location of the descen- 
dants in the M»-Mh plane, shown in Fig. 14. The z = 0.f9 
fit of the corresponding model is plotted for comparison, 
along with the z = 1.3 fit. The z — 0.19 descendants tend to 
have halo masses that are lower than those predicted by the 
corresponding SHMR. The most massive galaxies tend to 
deviate more from the SHMR, but in all cases the discrep- 
ancy is within the estimated scatter on the observationally 
determined SHMR (BfO, Lf2). As discussed previously, star 
formation would make the discrepancy larger, which sug- 
gests that, within the context of a ACDM Universe, dissi- 
pative mergers cannot have contributed much to the growth 
of ETGs at z< 1. 

We conclude that the relatively mild average evolution 
of ETGs between z ~ f .3 and z ~ is marginally consistent 
with a ACDM-based dry-merger model. However, as we dis- 



5.2 Scatter in the scaling laws 

ft is well known that the local observed scaling relations 
of ETGs are remarkably tight. The existence of these scal- 
ing laws and their tightness represent a severe challenge for 
any theory of galaxy formation. For example, it has been 
shown that it is hard to bring ETGs onto the local scal- 
ing laws (within their small scatter) via a stochastic growth 
process such as merging (Nipoti, Londrillo, & Ciotti 2003; 
Ciotti, Lanzoni, & Volonteri 2007; Nair, van den Bergh, & 
Abraham 201f , N09a; N09b). In this paper we have assumed 
that every ETG evolves according to the expected average 
growth history. In this way, we have so far neglected several 
sources of scatter in the properties of progenitor or descen- 
dant galaxies. In other words, two identical ETGs at a given 
redshift are predicted by our models to have identical pro- 
genitors and identical descendants. This is clearly not real- 
istic, because we expect a distribution of merging histories. 
An additional source of scatter is the intrinsic scatter of the 
SHMR that we adopt to match stars and halos. Finally, the 
distribution of merger orbital parameters adds scatter to the 
distribution of the slopes ct* R and a% characterizing the evo- 
lution of Re and <ro during an individual merger event (see 
Section 3.3). 

These additional sources of scatter are clearly a prob- 
lem. The size-mass-velocity dispersion correlations of our 
z ~ model descendants are already characterized by a 
substantial spread (see Figs. 12-13), even neglecting these 
effects, fn part, the spread might reflect observational un- 
certainties in the data. However, this is a small effect. N12 
recently showed that the observed scatter of the M*-R c re- 
lation does not increases significantly with redshift in the 
range 0.4 < z < 2.5. Therefore, unless there is some form 
of fine tuning or conspiracy, we expect that inclusion of 
the aforementioned sources of intrinsic scatter would lead 
to even larger spread. 

Consider for example, the expected scatter in da/dM, 
and dR c /dM t due to the range of merging orbital param- 
eters. By combining the simulations of N09a with the set 
of minor-merging simulations presented in Section 3.3, we 
find that the tightness of the local M,-R c implies that lo- 
cal massive ETGs can have assembled at most ~ 45% of 
their stellar mass via dry mergers during their entire merger 
history. This is an upper limit, under extreme fine tuning 
(see Nipoti 2011, for details). For comparison, our cosmo- 
logically motivated models predict z ~ descendant ETGs 
to have assembled ~ 50 - 60% (B) and ~ 40 - 50% (L) of 
their stellar mass via dry mergers since z = f.3 [see panels 
(a) and (b) in Fig. 12]. This is higher than the maximum 
limit for extreme fine tuning. Taking into account the addi- 
tional scatter in the SHMR and in the merging history would 
only exacerbate the problem. This result, based on cosmo- 
logically motivated merger histories, extends and supersedes 
that obtained by N09b under more idealized conditions. 
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Figure 14. Same as Fig. 6, but showing the future evolution of 
the reference z ~ 1.3 ETGs (sample VI) to z = 0.19 for model 
B (upper panel) and model L (lower panel). The symbols are the 
same as in Fig. 7. Thin solid lines indicate the statistical scatter 
"logM, of the z = 0.19 SHMR: for model B we take 0i O gM* as 
given by equation (22) with z = 0.19; for model L we assume 
°logM, = 0.2 (see L12). 



6 DISCUSSION 

We have developed dry-merging evolution models of ETGs 
based on cosmologically determined merger rates and cali- 
brated on iV-body simulations of individual mergers between 
spheroids. This hybrid strategy allowed us to compute ac- 
curately observables such as size, stellar velocity dispersion 
and mass, and their evolution within a cosmological con- 
text. Dissipative effects were neglected, so as to maximize 
the predicted decrease in density with time. This conserva- 
tive approach allowed us to draw general conclusions on the 
ability of ACDM merging models to reproduce the observed 
size evolution. 

The predictions of our models were tested by consider- 
ing two well defined samples of ETGs at z ~ 1.3, computing 
the predicted properties of their progenitors at z ~ 2.2 and 
comparing them to those of real observed galaxies. As an 



additional check, we have tested our predictions against the 
local scaling laws of ETGs. 

Our main finding is that the size evolution of massive 
ETGs from z > 2 to z ~ 1.3 cannot be explained exclusively 
by dissipationless major and minor merging. This result is 
robust with respect to uncertainties in the correlation be- 
tween stellar and halo mass at z> 1. Intuitively and quali- 
tatively, the main motivation is that size growth is coupled 
to mass growth even in minor mergers. Therefore, substan- 
tial size growth also requires significant mass growth, more 
than the evolution in the stellar mass function would allow. 
Furthermore, significant size growth requires several merg- 
ers and increased scatter in the scaling relations, larger than 
their tightness in the local Universe would allow. 

In addition to the evolution in stellar mass, size, and 
stellar velocity dispersion of ETGs, we studied the redshift 
evolution of their dark-to-luminous mass ratio under the 
same dry-merging scenario. A comparison of the predicted 
evolution with the measured one shows a similar tension 
between theory and data. Dry mergers tend to move galax- 
ies away from the observed SHMRs, suggesting, e.g., that 
a pure dry-merging scenario is inconsistent with a redshift- 
independent SHMR at z>l. Even though more accurate 
measurements of the SHMR are needed to draw strong con- 
clusions, it is clear that this is a promising observational 
diagnostic tool of dry-merger models. 

One important caveat to our analysis is that we as- 
sume that the progenitors of local or intermediate redshift 
ETGs are also spheroids. Theoretically it is possible that 
they might be disc-dominated (see, e.g., Feldmann et al. 
2010). Observationally, it is not clear whether this assump- 
tion is justified, since the morphology of high- z massive com- 
pact galaxies is not always well determined and they might 
include a large fraction of disc-dominated systems (van der 
Wei et al. 2011; Weinzirl et al. 2011). Conversely, it is also 
possible that the present-day descendants of z > 2 ETGs 
might be the bulges of massive disc galaxies (Graham 2011). 
The key question is how much are results changed if we allow 
for morphological transformations. A quantitative answer to 
this question would require numerical investigation beyond 
the scope of this paper. Qualitatively, the strict coupling be- 
tween mass and size evolution ultimately comes from energy 
conservation. Therefore it should hold independently of the 
morphology of the merging galaxies. 

Throughout the paper we have also assumed that dur- 
ing a merger the accreted system is a spheroidal galaxy ly- 
ing on the observed size-mass relation of ETGs (R c oc M? R 
with /3r ~ 0.6). In principle, it is possible that a substan- 
tial fraction of the accreted satellites are low-surface density 
disc galaxies, which do not form stars efficiently and deposit 
most of their stellar mass in the outskirts of the main galaxy. 
This might be a more efficient mechanism to increase galaxy 
size for given increase in stellar mass. Ad hoc numerical sim- 
ulations would be required to assess the possible effect of 
this process quantitatively: to zero-th order approximation 
such an effect can be implemented in our model by forcing 
a value of f3n smaller than observed for ETGs which im- 
plies stronger size evolution (see equation 29). However, as 
pointed out above, it turns out that varying /3r has a rela- 
tively small effect on the predicted size evolution, which is 
not sufficient to reconcile the models with the observations. 

Our findings suggest that the ultra-dense high- z ETGs 
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might be an anomaly even in a hierarchical ACDM universe 
in which most mergers are dry. In principle, this might be 
indicating that the actual dry-merger rate is higher than 
predicted by the considered ACDM model (for instance, 
because the cosmological parameters are substantially dif- 
ferent from what we assume; see also Section 3.1.2). To test 
this hypothesis we can compare the merger rate of our mod- 
els with the merger rate inferred from observations of galaxy 
pairs. For instance, N12, considering mergers with mass ra- 
tios > 0.1, find that in the redshift range 1.5 < z < 2 the typ- 
ical merger rate per galaxy is dN melg /dt = 0.18±0.06/r (for 
observed quiescent galaxies with M* > 10 10,4 Mo), where 
r = 1 — 2 Gyr is the merging time. Adopting the same cut 
in stellar mass, merger mass ratio and redshift, we find, on 
average, dNmerg/dt ~ 0.22 Gyr -1 (model W), dAWg/di ~ 
0.4 Gyr" 1 (model B) and dN mclg /dt ~ 0.17 Gyr" 1 (model 
L), taking as descendant sample Rl. This means that in fact 
the model merger rates tend to be higher than those esti- 
mated observationally, so it is unlikely that the difficulties 
of ACDM dry-merger models are due to an underestimate 
of the merger rate. 

Alternatively, the tension between the data and the 
model might be alleviated if there are other physical pro- 
cesses, not included in our models, that contribute to make 
galaxies less compact with evolving cosmic time. An inter- 
esting proposal is expansion due to gas loss following feed- 
back from AGN (Fan et al. 2008, 2010), which, in principle, 
could naturally explain the observation that most of the size 
evolution occurs at higher redshift, when AGN feedback is 
believed to be most effective. However, no satisfactory fully 
self-consistent model of size evolution via AGN feedback has 
been proposed so far and it is not clear whether it can be 
a viable solution. In particular, it appears hard to recon- 
cile this scenario with the relatively old stellar populations 
of the observed compact high-z ETGs, because the char- 
acteristic timescale of expansion due to AGN-driven mass 
loss is so short that the galaxy is expected to have already 
expanded when it appears quiescent (Ragone-Figueroa & 
Granato 2011). Otherwise, it is possible that observations 
are affected by systematics or selection biases which maybe 
not fully understood (Hopkins et al. 2010a; Mancini et al. 
2010; Oser et al. 2012). 

7 CONCLUSIONS 

The goal of this paper was to investigate whether dry merg- 
ing alone is sufficient to explain the observed size evolution 
of elliptical galaxies from z > 2 to the present. We focused 
primarily on the short ~ 1.8 Gyr time span between z ~ 2.2 
and z ~ 1.3 when much of the size evolution appears to 
take place. We find that the observed size evolution is in 
fact stronger than predicted by ACDM dry-merging mod- 
els. Quantitatively, our main results can be summarized as 
follows: 

(i) According to our ACDM-based pure dry-merging 
models, at redshifts 1 < z < 2.5 a typical massive (M* ~ 
IO^Mq) ETG is expected to evolve in stellar mass as M* oc 
(1 + z) aM , size as R c oc (1 + z) aR and velocity dispersion as 
cr oc (l + z) a °, with -1.5<a M < -0.6, -1.9<a fl < -0.7 
and 0.06 <a CT <0.22; the corresponding evolution in stellar- 
mass surface density is M*/RZ oc (1 + z)°' s ~ 2 ' 4 . 



(ii) The predicted z > 2 dry-merger progenitors of z ~ 1.3 
massive ETGs are, on average, less massive and less compact 
than the real massive quiescent galaxies observed at similar 
redshifts. The median offset from the local M,-R B relation- 
ship is Alog_R c ~ —0.1 dex (i.e. R e /R e ,iocai ~ 0.8) for model 
progenitors, and A log R c ~ —0.4 dex (i.e. R e /R e , local ~ 0.4) 
for observed high-z galaxies, i.e. the latter are smaller in size 
by a factor of ~ 2 at given stellar mass. 

(iii) Dry mergers introduce substantial scatter in the scal- 
ing relations of ETGs. Even models that reproduce the av- 
erage size evolution from z<1.3 to z ~ require extreme 
fine tuning to be consistent with the small scatter of the 
local scaling laws. For instance, our ACDM-based models 
predict that local massive ETGs have accreted ~ 40 — 60% 
of their stellar mass via dry mergers since z ~ 1.3. How- 
ever, the tightness of the local R c -M, relation implies that 
these ETGs can have accreted in this way at most ~ 45% 
of their stellar mass over their entire assembly history (with 
extreme fine tuning; see also Nipoti 2011; Nipoti, Treu, & 
Bolton 2009a; Nipoti et al. 2009b). 

Our conclusion is thus that dry mergers alone, whether 
minor or major, are insufficient to explain the observed 
growth of massive galaxies. This is in good agreement with 
the results of several studies, including that by N12 and 
those of Fan et al. (2010); Shankar et al. (2011). It is in- 
teresting to compare in particular with the results by N12, 
which are based on the same dataset, augmented by number 
density considerations, but a completely different analysis. 
N12 show that the observed number of merging satellites is 
insufficient to cause sufficient evolution, while we show that 
the theoretically predicted rates are insufficient. Given the 
completely different analysis and different systematic un- 
certainties it is encouraging that the results are mutually 
consistent. 
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